Trade-off between cost and goodput in wireless: 
Replacing transmitters with coding 
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Abstract — We study the cost of Improving the goodput, or the 
useful data rate, to user in a wireless network. We measure 
the cost in terms of number of base stations, which is highly 
correlated to the energy cost as well as capital and operational 
costs of a network provider. We show that increasing the available 
bandwidth, or throughput, may not necessarily lead to increase 
in goodput, particularly in lossy wireless networks in which 
TCP does not perform well. As a result, much of the resources 
dedicated to the user may not translate to high goodput, resulting 
in an inefficient use of the network resources. We show that using 
protocols such as TCP/NC, which are more resilient to erasures 
and failures in the network, may lead to a goodput commensurate 
the throughput dedicated to each user. By increasing goodput, 
users' transactions are completed faster; thus, the resources 
dedicated to these users can be released to serve other requests or 
transactions. Consequently, we show that translating efficiently 
throughput to goodput may bring forth better connection to users 
while reducing the cost for the network providers. 

I. Introduction 

Mobile data traffic has been growing at an alarming rate 
with some estimating that it will increase more than 25-folds 
in the next five years Hj. In order to meet such growth, 
there has been an increasing effort to install and upgrade 
the current networks. As shown in Figure [T] mobile service 
providers often install more infrastructure (e.g. more base 
stations) in areas which akeady have full coverage. The new 
infrastructure is to provide more bandwidth, which would lead 
to higher quality of experience to users. However, this increase 
in bandwidth comes at a significant energy cost as each base 
station has been shown to use 2-3 kilowatts (kW) ||2l. The 
sustainability and the feasibility of such rapid development 
have been brought to question as several trends indicate that 
the technology efficiency improvements may not be able to 
keep pace with the traffic growth JJ). 

We show that maintaining or even improving users' quality 
of experience may be achieved without installing more base 
stations. In some cases, we show that the users' quality of 
experience may be improved while reducing the number of 
base stations. We measure users' quaUty of experience using 
the throughput perceived by the user or the application, i.e. 
goodput. We make a clear distinction between the terms 
goodput and throughput, where goodput is the number of 
useful bits over unit time received by the user and throughput is 
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Fig. 1: As number of users in a given area grows, a service 
provider may add additional base stations not for coverage but 
for bandwidth. As red users join the network, a second base 
station may be necessary; as green users join the network, a 
third base station may become necessary in order to maintain 
a certain level of quality of service. 

the number of bits transmitted by the base station per unit time. 
In essence, throughput is indicative of the bandwidth/resources 
provisioned by the service providers; while goodput is indica- 
tive of the user's quality of experience. For example, the base 
station, taking into account the error correction codes, may 
be transmitting bits at 10 megabits per second (Mbps), i.e. 
throughput is 10 Mbps. However, the user may only receive 
useful information at 5 Mbps, i.e. goodput is 5 Mbps. 

There can be a significant disparity between throughput 
and goodput, particularly in lossy networks using TCP. TCP 
often mistakes random erasures as congestion ||3], ||4]. For 
example, 1-3% packet loss rate is sufficient to harm TCP's 
performance ||3]-|l6]. This performance degradation can lead 
to inefficient use of network resources and incur substantially 
higher cost to maintain the same goodput. There has been 
extensive research to combat these harmful effects of erasures 
and failures; however, TCP even with modifications does not 
achieve significant improvement. References ||4], Q give an 
overview of various TCP versions over wireless links. 

This disparity between throughput and goodput can be 
reduced by using a transport protocol that is more resilient 
to losses. One method is to use multiple base stations simul- 
taneously (using multiple TCP connections [81 or multipath 
TPC |]9l). However, the management of the multiple streams 
or paths may be difficult, especially in lossy networks. Further- 
more, each path or TCP stream still suffer from performance 
degradation in lossy environments ||8l, ||9l. 

We propose TCP/NC ID, ifTOl as such an alternate transport 
protocol. TCP/NC uses network coding, and modifies TCP's 
acknowledgment (ACK) scheme such that random erasures do 
not affect the transport layer's performance. TCP/NC receiver 
acknowledges degrees of freedom instead of individual packets 
as shown in Figure |2] Once enough degrees of freedoms are 
received at the receiver, the decoder solves the set of linear 
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Fig. 2: Example of TCP and TCP/NC. In the case of TCP, the 
TCP sender receives dupUcate ACKs for packet pi, which 
may wrongly indicate congestion. However, for TCP/NC, the 
TCP sender receives ACKs for packets pi and P2; thus, the 
TCP sender perceives a longer RTT but does not mistake the 
loss to be congestion. 

equations to decode the original data. TCP/NC may not be 
the only viable solution, and other transport protocols that 
can combat erasures may be used. We use TCP/NC for its 
effectiveness and simplicity. 

TCP/NC allows a better use of the base stations installed, 
and can improve the goodput without any additional base 
stations. Improving the goodput with the same or a fewer 
number of base stations implies reduction in energy cost, 
operational expenses, capital expenses, and maintenance cost 
for the network provider. The results in this paper can also 
be understood as being able to serve more users or traffic 
growth with the same number of base stations. This may lead 
to significant cost savings, and may be of interest for further 
investigation. 

II. Model 

Consider a network with n users. We assume that these n 
users are in an area such that a single base station can cover 
them as shown in Figure [T] If the users are far apart enough 
that a single base station cannot cover the area, then more 
base stations are necessary; however, we do not consider the 
problem of coverage. 

The network provider's goal is to provide a fair service to 
any user that wishes to start a transaction. Here, by fair, we 
mean that every user is expected to receive the same average 
throughput, denoted as rt Mbps. The network provider wishes 
to have enough network resources, measured in number of 
base stations, so that any user that wishes to start a transaction 
is able to join the network immediately and achieve an average 
throughput of rt Mbps. We denote rg to be the goodput 
experienced by the user Note that Vg < rt- 

We denote ni,s to be the number of base stations needed to 
meet the network provider's goal. We assume that every base 
station can support at most -Rmax Mbps (in throughput) and 
at most iVmax active users simultaneously. In this paper, we 
assume that i?niax = 300 Mbps and A^max = 200. 

A user is active if the user is currently downloading a 
file; idle otherwise. A user decides to initiate a transaction 
with probability p at each time slot. Once a user decides to 
initiate a transaction, a file size of / bits is chosen randomly 
according to a probability distribution Pf. We denote /i/ to 
be the expected file size, and the expected duration of the 



transaction to be A = ^//''g seconds. If the user is already 
active, then the new transaction is added to the user's queue. 
If the user has initiated k transactions, the model of adding the 
jobs into the user's queue is equivalent to splitting the goodput 
Vg to k transactions (each transaction achieves a rate of Tg/k 
Mbps). 

We denote pp to be the probability of packet loss in the 
network, and RTT to be the round-trip time. In a wireless, pp 
and RTT may vary widely. For example, wireless connection 
over WiFi may have RTT ranging from tens of milliseconds 
to hundreds of milliseconds with loss rates typically ranging 
from 0-10%. In a more managed network (such as cellular 
networks), RTT are typically higher than that of a WiFi 
network but lower in loss rates. 

III. Analysis of the Number of Base Stations 

We analyze nt,, needed to support n users given throughput 
rt and goodput rg. We first analyze P{A,p), the probability 
that a user is active at any given point in time. Given P{A,p), 
we compute the expected number of active users at any given 
point in time and n^s needed to support these active users. 

Consider a user u at time t. There are many scenarios in 
which u would be active at t. User u may initiate a transaction 
at precisely time t with probability p. Otherwise, u is still in 
the middle of a transaction initiated previously. 

To derive P(A,p), we use the Little's Law. For a stable 
system, the Little's Law states that the average number of jobs 
(or transactions in our case) in the user's queue is equal to the 
product of the arrival rate p and the average transaction time A. 
When Ap > 1, we expect the user's queue to have on average 
at least one transaction in the long run. This implies that the 
user is expected to be active at all times. When Ap < 1, 
we can interpret the result from Little's Law to represent the 
probability that a user is active. For example, if Ap ~ 0.3, 
the user's queue is expected to have 0.3 transactions at any 
given point in time. This can be understood as the user being 
active for 0.3 fraction of the time. Note that when the system is 
unstable, the long term average number of uncompleted jobs 
in the user's queue may grow unboundedly. In an unstable 
system, we assume that in the long term, a user is active with 
probability equal to one. 

Therefore, we can state the following result for P{A,p). 



P{A,p) = min{l, Ap} = min <{ 1, -^ • p 



(1) 



Given P{A,p), the expected number of active users is 
nP{A,p). We can now characterize the expected number of 
base stations needed as 



ribs 



nP{A,p) ■ max 



rt 



1 



Rn 



N„ 



(2) 



In Equation (|2]i, max{- 



inax . 

-jj — ,^ — } represents the amount 
of base stations' resources (the maximum load i?max or the 
amount of activity A^max) each active user consumes. The 
value of ribs from Equation dU may be fractional, indicating 
that actually [ribsl base stations are needed. 
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Fig. 3: The values of Ubs from Equation dU with n 
and varying p and r. 
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Note the efr'ect of rt and Vg. As shown in Equation ^, 
increasing rt incurs higher cost while increasing r^ reduces the 
cost. Therefore, when a network provider dedicates resources 
to increase rt, the goal of the network provider is to increase 
Tg proportional to rt- 

IV. Best Case Scenario 

In an ideal scenario, the user should see a goodput rg = rt- 
In this section, we analyze this best case scnario with r ^ 
rt = rg. Once we understand the optimal scenario, we then 
consider the behavior of TCP and TCP/NC in Section |V] 

A. Analytical Results 

In Figures |3a] and [3b] we plot Equation dU with Hf = 3.2 
MB and yU/ = 5.08 MB for varying values of p. As r increases, 
it does not necessarily lead to increase in ribs- Higher r results 
in users finishing their transactions faster, which in turn allows 
the resources dedicated to these users to be released to serve 
other requests or transactions. As a result, counter-intuitively, 
we may be able to maintain a higher r with the same or a 
fewer number of base stations than we would have needed for 
a lower r. For example, in Figure |3a] when r < 1 Mbps, the 
rate of new requests exceeds the rate at which the requests 
are handled; resulting in an unstable system. As a result, most 
users are active all the time, and the system needs ^" = 
-^jIj^ = 5 base stations. 

There are many cases where nts is relatively constant 
regardless of r. For instance, consider p = 0.03 in Figure l3b] 
The value of ni,s is approximately 4-5 throughout. However, 
there is a significant difference in the way the resources 
are used. When r is low, all users have slow connections; 
therefore, the base stations are fully occupied not in throughput 
but in the number of active users. On the other hand, when r is 
high, the base stations are being used at full-capacity in terms 
of throughput. As a result, although the system requires the 
same number of base stations, users experience better quality 
of service and users' requests are completed quickly. 

When p and r are high enough, it is necessary to increase 
nt,s- As demand exceeds the network capacity, it becomes 
necessary to add more infrastructure to meet the growth in 
demand. For example, consider p = 0.04 in Figure |3b] In this 
case, as r increases n^s increases. 
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Fig. 4: Average value of rn,s over 100 iterations with n = 1000 
and varying p and r. 

B. Simulation Results 

We present MATLAB simulation results to verify our analy- 
sis results in Section lTV-AI We assume that at every 0. 1 second, 
a user may start a new transaction with probability ^. This 
was done to give a finer granularity in the simulations; the 
results from this setup is equivalent to having users start a 
new transaction with probability p every second. We assume 
that there are n ~ 1000 users. For each iteration, we simulate 
the network for 1000 seconds. Each plot is averaged over 100 
iterations. 

Once a user decides to start a transaction, a file size is 
chosen randomly in the following manner. We assume there 
are four types of files: fdoc = 8KB (a document), fimage 
= 1MB (an image), f,np3 = 3 MB (a mp3 file), /„ideo = 
20 MB (a small video), and are chosen with probability 
Pdoc, Pimage, Pmp3, and Pyideo, respectively. In Figure ga] 

we set [pdoc, Pimage, Pmp3,Pvideo] = [0.3,0.3,0.3,0.1]. This 

results in /i/ = 3.2 MB as in Figure [3a] In Figure |4b] we 

set [pdoc,Pirnage,Prnp3,Pvideo] = [0.26, 0.27, 0.27, 0.2], which 

gives Hf = 5.08 MB as in Figure [3b] 

The simulation results show close concordance to our anal- 
ysis. Note that the values in Figures [4a] and [4b] are slightly 
greater than that of Figures [3a] and [3b] This is because, in the 
simulation, we round-up any fractional n(,s's since the number 
of base stations needs to be integral. 

V. Analysis for TCP/NC and TCP 

We now study the effect of TCP and TCP/NC's behavior. 
We use the model and analysis from f5]| to model the rela- 
tionship between rg and pp for TCP and TCP/NC. We denote 



-nc to be the goodput when using TCP/NC, and r, 



g-tcp 



to be 



that for TCP. We set the maximum congestion window, M^max, 
of TCP and TCP/NC to be 50 packets (with each packet being 
1000 bytes long), and their initial window size to be 1. We 
consider RTT = 100 ms and varying pp from 0% to 5%. We 
note that, given rt and pp, rg < rt{l ~ Pp) regardless of the 
protocol used. 

In Q, QUI, TCP/NC has been shown to be robust against 
erasures; thus, allowing it to maintain a high throughput 
despite random losses. For example, if the network allows 
for 2 Mbps per user and there is 10% loss rate, then the user 
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Fig. 5: The value of rg-nc and Vg-tcp against rt for varying 



should see approximately 2-(l — 0.1) = 1.8 Mbps. Reference 
||5| has shown, both analytically and with simulations, that 
TCP/NC indeed is able to achieve goodput close to 1.8 Mbps 
in such a scenario while TCP fails to do so. 

A. Behavior of rg^nc with varying pp 

Equation (20) from |5| provides the goodput behavior of 
TCP/NC, which we provide below in Equation (|3]l. 

1 /.„. (W^^ax-l)' + (VK,,ax-l)^ 



tSRTT 



tWn, 



restricted by WVax- However, TCP achieves this maximal 
goodput only when pp = 0%. This is because, when there 
are losses in the network, TCP is unable to recover effectively 
from the erasures and fails to use the bandwidth dedicated to 
it. For Pp > 0%, Tg^tcp is not limited by W,„ax but by TCP's 
performance limitations in lossy wireless networks. 

C. The Number of Base Stations for TCP/NC and TCP 

We use the values of Vg^nc and Vg^tcp from Sections IV-AI 
and IV-BI to compare the number of base stations for TCP/NC 
and TCP using Equation ©. We assume that SRTT = RTT. 
In general, SRTT is slightly larger than RTT. 

Figures |6] and |7] show n^s predicted by Equation (|2]i when 
RTT = 100 ms. TCP suffers performance degradation as 
Pp increases; thus, ribs increases rapidly with pp. Note that 
increasing rt without being able to increase rg leads to 
inefficient use of the network, and this is clearly shown by 
the performance of TCP as rt increases with pp > 0%. 

However, for TCP/NC, n^s does not increase significantly 
(if any at all) when pp increases. As discussed in Section 
Hill TCP/NC is able to translate better rt into rg^nc despite 
Pp > 0%, i.e. rt w rg^nc As a result, this leads to a significant 
reduction in rif,, for TCP/NC compared to TCP. Note that n^s 
for TCP/NC is approximately equal to the values of ribs in 
Section [III] regardless of the value of pp. Since TCP/NC is 
, resilient to losses, the behavior of rg-nc does not change as 
(3) dramatically against pp as that of rg-tcp does. As a result, we 



where SRTT is the effective RTT observed by TCP/NC 
and increases with pp and t represents the duration of the 
connection (in number of RTTs). Equation (|3]l shows the effect 
of network coding. The goodput of TCP/NC decreases with pp; 
however, the effect is indirect. As pp increases, the perceived 
RTT increases, which leads to TCP/NC reducing its rate. 

Combining Equation ^ and rg-nc < rtil—pp), we obtain 
the values of rg-nc for various rt, RTT, and pp. In Figure 
l5al the values of rg^nc plateaus once rt exceeds some value. 
This is caused by Wmax- Given Wmax and RTT, TCP/NC and 
TCP both have a maximal goodput it can achieve. In the case 
with RTT =100 ms, the maximal goodput is approximately 
4 Mbps. Note that regardless of pp, all TCP/NC flows achieve 
the maximal achievable rate. This shows that TCP/NC can 
overcome effectively the erasures or errors in the network, 
and provide a goodput that closely matches the throughput rt. 

B. Behavior of rg^tcp with varying pp 

Equation (16) from Q provides the goodput behavior of 
TCP, which we provide below in Equation (JUi. 



Wrr 



' g-tcp 



Pp 



RTT' Pp ^tt{I + ,/II^) 



(4) 



Note that unlike TCP/NC, TCP performance degrades propor- 
tionally to w i. 

Combining Equation ^ and rg-tcp < ?'*(! —Pp), we obtain 
the values of rg^tcp for various rt, RTT, and pp as shown 
in Figure |5b] As in Figure [Sa] the values of rg^tcp are also 



observe nts for TCP/NC to reflect closely the values of Ubs 
seen in Section Hill which is the best case with rt = rg. 

We observe a similar behavior for other values of RTT 
as we did for RTT = 100 ms. The key effect of the value 
of RTT in the maximal achievable goodput. For example, 
if Wmax is limited to 50, the maximal achievable goodput 
is approximately 0.8 Mbps when RTT = 500 ms, which is 
much less than the the 4 Mbps achievable with RTT = 100 
ms. As a result, for RTT = 500 ms, neither rg-nc nor rg-tcp 
can benefit from the increase in rt beyond 0.8 Mbps. Despite 
this limitation, TCP/NC still performs better than TCP when 
losses occur When demand exceeds the maximal achievable 
goodput, nt,s increases for both TCP/NC and TCP in the same 
manner We do not present the results for want of space. 

VI. Conclusions 

In wireless networks, the solution to higher demand is often 
to add more infrastructure. This is indeed necessary if all 
the base stations are at capacity (in terms of throughput). 
However, in many cases, the base stations are "at capacity" 
either because they are transmitting redundant data to recover 
from losses; or because they cannot effectively serve more 
than a few hundred active users. This may be costly as base 
stations are expensive to operate. One way to make sure that 
wireless networks are efficient is to ensure that, whenever base 
stations are added, they are added to effectively increase the 
goodput of the network. 

We studied the number of base stations ribs needed to 
improve the goodput rg to the users. It may seem that higher rg 
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Fig. 6: The value of ribs from Equation © for TCP and TCP/NC with varying p^ and p. Here, RTT = 100 ms, M^max = 50, 
n = 1000, and /i/ = 3.2 MB. In (a), pp = and both TCP and TCP/NC behaves the same; thus, the curves overlap. Note that 
this result is the same as that of Figure |3a] In (b), the value of rn,s with TCP for p ~ 0.03 and 0.04 coincide (upper most red 
curve). In (c) and (d), the values of rn,s with TCP for p > 0.01 overlap. 
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Fig. 7: The value of nbs from Equation © for TCP and TCP/NC with varying pp and p. Here, i^TT = 100 ms, M^max = 50, 
rt = 1000, and ^/ = 5.08 MB. In (a), the results for TCP and TCP/NC are the same. Note that this result is the same as that 
of Figure |3b] In (b) and (c), the value of n\js with TCP for p > 0.01 coincide (upper red curve). In (d), the values of ni,s with 
TCP for any p all overlap. We do not show results for pp = 4% or 5% as they are similar to that of (d). 



necessarily increases ribs- Indeed, if there are enough demand 
(i.e. rg, p, or fif are high enough), we eventually need to 
increase Ubs- However, we show that this relationship is not 
necessarily true. When rg is low, each transaction takes more 
time to complete and each user stays in the system longer This 
degrades the user experience and delays the release of network 
resources dedicated to the user This is particularly important 
as the number of active users each base station can support 
is Umited to the low hundreds. We observed that, given r^, 
achieving low Tg may lead to a significant increase in Ubs and 
an ineffective use of the network resources; while achieving 
high rg may lead to reduction in ribs- 

We showed that, in lossy networks, the goodput rg observed 
may not closely match the amount of resources dedicated to 
the user, e.g. rg <^ rt- This is due to the poor performance 
of TCP in lossy networks. To combat these harmful effects, 
network providers dedicate significant amount of resources, 
e.g. retransmissions and error corrections, to lower the loss 
rates. This, however, results in the base station transmitting 
at high throughput rt but little translating to goodput rg. We 
showed that TCP/NC, which is more resilient to losses than 
TCP, may better translate rt to rg. Therefore, TCP/NC may 
lead to a better use of the available network resources and 
reduce the number of base stations Ubs needed to support users 
at a given rg. 
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